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ABSTRACT 

Weak lensing surveys provide a powerful probe of dark energy through the measure- 
ment of the mass distribution of the local Universe. A number of ground-based and 
space-based surveys are being planned for this purpose. Here, we study the optimal 
strategy for these future surveys using the joint constraints on the equation of state 
parameter w n and its evolution w a as a figure of merit by considering power spec- 
trum tomography. For this purpose, we first consider an 'ideal' survey which is both 
wide and deep and exempt from systematics. We find that such a survey has great 
potential for dark energy studies, reaching one sigma precisions of 1% and 10% on the 
two parameters respectively. We then study the relative impact of various limitations 
by degrading this ideal survey. In particular, we consider the effect of sky coverage, 
survey depth, shape measurements systematics, photometric redshifts systematics and 
uncertainties in the non-linear power spectrum predictions. We find that, for a given 
observing time, it is always advantageous to choose a wide rather than a deep sur- 
vey geometry. We also find that the dark energy constraints from power spectrum 
tomography are robust to photometric redshift errors and catastrophic failures, if a 
spectroscopic calibration sample of 10 4 — 10 5 galaxies is available. The impact of these 
systematics is small compared to the limitations that come from potential uncertain- 
ties in the power spectrum, due to shear measurement and theoretical errors. To help 
the planning of future surveys, we summarize our results with comprehensive scaling 
relations which avoid the need for full Fisher matrix calculations. 
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1 INTRODUCTION 

Over the past decade, cosmic shear measurements have 
proved to be powerful tools for probing the properties of the 
observable Universe. Current cosmic shear measurements 
typically cover areas o f a few to a few tens of s quare degrees 
1 Massev et all 12004s] ; [Heymans ct al. 2004| iRhodes et all 
12004 iHoekstra et al.ll2006l : ISemboloni et alj l2006h and are 
able to measure the amplitude of the matter power-spectrum 
to an accuracy of ~ 10%. For these surveys, the dom- 
inant uncertainties come from the statistical errors (cos- 
mic variance) arising from the limited survey area coverage 
and observational systematics, such as uncertainties in the 
Point Spread Function (PSF). Within the next decade, a 
number of surveys have been proposed (such as DUNIjQ, 



PanSTARRS3, DE33, VISTA0, SNAffl and LSST0) that 
will observe several thousand square degrees with the aim of 
constraining the matter power-spectrum with much greater 
accuracy. Survey areas of this size overcome many of the 
problems of statistical uncertainties, so the immediate chal- 
lenge facing these surveys is to develop techniques for con- 
trolling systematic errors. 

One of the major goals of future large area cosmic 
shear surveys is to understand the nature of dark energy 
by placing tight constraints on the equations of state pa- 
rameter, w, which is the ratio of pressure to the density 
of the dark energy. These future lensing surveys should be 
able to place constraints on both the overall value of the 
equation of state and also its evolution with time by using 
the redshift information of the lensed galaxies. However, un- 
derstanding the impact of systematic errors is still in need 
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of a great deal of study. In this paper, we investigate how 
future weak lensing surveys can be optimised to yield the 
strongest constraints on dark energy. For this purpose, we 
make use of the figure of merit (FOM) proposed by the Dark 
Energy Task Force (detfJI to compare the strength of fu- 
ture dark energy surveys with different cosmological probes. 
For our optimisation study, we will only consider weak lens- 
ing power spectrum tomography, while other weak lensing 
statistics, such as the bispectrum and cross-correlation cos- 
mography, have been shown to provide a dditional cosmolog- 
ical constraints Jjain and Tavlorl 120031 ; iBernstein and Jainl 
120041 : iTavlor et alJl200d ). Although these other techniques 
could provide extra information and may require a different 
optimisation, we choose lensing tomography for optimisa- 
tion since it is the most tested and robust technique and as 
a single method offers the strongest constraints. 

We thus investigate the impact of three distinct poten- 
tial limitations of future surveys: (i) the statistical error im- 
posed by a specific survey geometry; (ii) photo-z errors; and 
(iii) uncertainties in the power-spectrum arising from shear 
measurement systematics and theoretical uncertainties. We 
assume that cosmic variance errors are Gaussian, which does 
not hold on small scales. The impact of full non-Gaussian 
errors still needs to be quantified further. 

This paper is organised as follows. In section 2, we set 
out the background theory for our dark energy models, weak 
lensing tomography and Fisher matrix formalism. In section 
3, we set out the properties of our benchmark surveys. The 
impact of limitations, such as finite observing time and sys- 
tematics errors, are explored in section 4 and discussed in 
section 5. We conclude and summarise our work in section 
6. Finally, appendix A describes our model for incorporating 
photometric redshift errors. 



2 THEORY 



straightforward method is to calculate the FOM using the 
square root of inverse of the determinant of the covariance 
matrix containing the dark energy parameter (w a and w n )- 
Our decision to use the FOM as a measure of qual- 
ity of future survey is driven by the fact that we believe 
future surveys should be judged on their ability to reduce 
the error bars on a two pa ramter w model. Other possi- 
ble measures of quality exist . lAlbrecht and Bernstein! (2006) 
show that future lensing surveys should have small errors, 
i.e. smaller than unity, for a w model with 9 eigen values. 
Since any one of these eigen values could be used as a di- 
agnostic tool to test ACDM, one could argue that all 9 el- 
ements should be included in the measure of quality of fu- 
ture surveys. This 9D-FOM could be expected to scale as 
the 9D - FOM oc FOM 9/2 . An alternative approach is to 
optimise directly using the errors on each w parameter. For 
instance, one could optimise using the errors on a constant 
w model (as show in figure [2} . A figure of merit constructed 
from the errors on one w parameter, 1D-FOM, would be 
expected to scale as the root of the FOM used in this pa- 
per (ID - FOM oc FOM 1/2 ). It should be noted that the 
FOM itself is not an error estimate for a given parameter. 
Instead it is an attempt to form a metric that quantifies the 
'goodness' of a survey. As we have illustrated this depends 
on both the number of parameters that a survey is able to 
constrain (i.e. errors smaller than unity) and the errors on 
those parameters. Clearly if one is interested in the errors 
on an individual dark energy paramter one would need to 
marginalise over all other parameters. These errors would 
then tend to scale in a similar way to the 1D-FOM. The 
exact choice of FOM will only enhance the relative merit 
of one survey over another and will not change which one 
is prefered. The DETF report uses a two parameter figure 
of merit, which is becoming the reference in the commu- 
nity. We, therefore, have chosen to stick consistently to the 
2 parameter FOM described above. 



2.1 Dark Energy model 

The impact of dark energy on gravitational lensing has 2 - 2 Weak lensing tomography 



been widely studied llHu and Tegm ark 



Jain and Tavlorl [2003!; ISong and Knox! 12004 



1999: 



Hutererl 



Ishak 



2002; 



2005; 



Ma et alJ 120061 : iTavlor et al.l 120061 ). with the effect de 



pending on the exact nature of the dark energy consid- 
ered. In this work, we consider the usual parametrisation 
|Chevallier and PolarskHl200ll : iLinderlliooH ) of evolution of 
the dark energy equation of state parameter, the w = p/p, 
i.e. 



w(a) =w n + {a„ — a)w a , 



(1) 



where a = (1 + z) _1 is the scale factor and a„ t he point a t 
which the taylor expansion of w(a) is performed (|Hu 2005). 

Since we wish to compare a variety of methods and con- 
figurations, a standard measure of the quality of a survey 
must be agreed upon. To this end, the FOM has emerged 
as a useful standard. The Figure is proportional to the in- 
verse of the area of the 2a ellipses in the w a — w n plane. 
This quantity can be calculated either by setting a n to be 
the 'pivot' point (i.e. the points where w n and w a are un- 
corrected), in which case FOM = l/(4Aw n Aw a )- A more 
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To evaluate the accuracy of weak lensing surveys, we use 
the power spectrum tomography formalism described by 
iHu and Jain! |2004). In this formalism, the background 
lensed galaxies are divided into redshift slices. The power 
spectrum corresponding to the correlations of shears, both 
within the slices and between slices, is then used to con- 
strain cosmology models. We ignore the additional infor- 
mation provided by galaxy counts, such as the galaxy-shear 
correlation fun ction. Our cosmologi cal models are calculated 
using a BBKS (|Bardeen et al.lll986l ) transfer function to cal- 
culate the linear power spectrum and iPeacock and Doddsl 
(1996) for the non-linear correction. The 3D power spec- 
trum is projected into a 2D lensing correlation function us- 
ing Limbers equation. 



2.3 Fisher Matrix 

The Fisher matrix for the shear power spectrum tomography 
is given by l|Hu and Jainl I2004T ) 

F afi = / sky J2 ( 2< + 1)A ' Tr[A a gr 1 A /3 gr 1 ], (2) 

I 



Optimal Surveys for Weak Lensing Tomography 3 



where the sum is over bands of multipole I of width Al, Tr 
stands for the trace, and / s k y is the fraction of sky covered 
by the survey. The observed shear power spectra, 

C»=C^+N^, (3) 

for each pair i, j of redshift bins are written as a sum of the 
lensing and noise power spectra, respectively. The derivative 
matrices are given by 



[D la ] 



(4) 



where p a is the vector of the model parameters considered. 

In this study, we use a 7 parameter fiducial flat cosmo- 
logical model p a = [Q m = 0.28, w n = —0.95, w a = 0.00, h = 
0.72, cr 8 = 1-0, n b = 0.046, n = 1.0]. When we determine the 
errors we expect on the w n and w a , we marginalise over the 
other 5 parameters without any external priors. 



3 BENCHMARK SURVEYS 

For our optimisation analysis, we consider several bench- 
mark surveys which we will use as references to study the 
impact of potential limiting factors and systematics. The 
first benchmark survey, which we call the 'ideal' survey that 
is both wide and deep. This is chosen to be ultra-wide, i.e. 
to cover the half of the sky that is not obscured by the 
Milky Way corresponding to a coverage of 20,000 square de- 
grees like DUNE. The depth of this ide al survey correspond 
to the depth of the S NAP wide survey l|Massev et al.ll2004bl; 
iRefregier et al ] |2004h . Such a survey can be expected to have 
100 galaxies per amin 2 with a median redshift of z m = 1.23. 
The survey is assumed to have no shape measurement sys- 
tematics errors and no error imposed on the predictions from 
theory and no photometric redshift errors. This survey can 
be considered to be ideal in the sense that, using a space- 
based like instrument, such as SNAP, it would take roughly 
thirty years to collect this data. For our benchmark calcula- 
tions, we also use the the scales in the range 10 < £ < 2 x 10 4 . 
We calculate the FOM for this survey, using cosmic shear 
tomography by dividing the galaxies into 10 redshift bins, 
and find FOM = 220. 

For comparison, we explore the effect of deviations from 
this 'ideal'. The first deviation consists of a maximal survey 
covering the 20,000 square degrees to a depth achieved by 
the SNAP deep survey (260 gal/arcmin 2 ). This is a truly 
extreme case, since this geometry requires three and half 
thousand years of observation time with the SNAP satel- 
lite. We also investigate a 'shallow' survey which could be 
achieved within 3 years and would cover 20,000 square de- 
grees to a depth of z m = 0.9, which is close to DUNE and 
LSST. Table Q] summarises the properties of these bench- 
mark surveys. Figure [1] contains our results showing the 1 
sigma errors ellipses on the w n — w a plane by marginalizing 
over the other 5 parameters. 



4 IMPACT OF LIMITATIONS 

Having appreciated the potentials of an ideal lensing survey, 
we examine how external limitations and systematic errors 
diminish the accuracy of future lensing surveys. 



Survey 


Area 


n g 




FOM 




sq. deg. 


gals/amin 2 






Shallow 


20 000 


35 


0.9 


50 


Ideal 


20 000 


100 


1.23 


220 


Maximal 


20 000 


260 


1.43 


600 



Table 1. Properties of the benchmark surveys. Also shown is the 
figure of merit for each of the configurations. 
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Figure 1. Dark energy constraints for a number of possible ideal 
surveys. The ideal survey covers half the sky (20,000 sq. degrees) 
with a median redshift of z m = 1.23 and 100 galaxies per amin 2 . 
Also shown is a maximal ideal survey with z m = 1.43 and 260 
galaxies per amin 2 . Finally, a shallow survey is shown that has 



35 galaxies per amin and z,. 



0.9 



4.1 Finite Observing Time 

The first limitation is that any future survey will be limited 
by a finite observing time. Within a realistic timeframe, of 
roughly 3 years, the ideal survey will be degraded in either 
depth or survey area. Optimizing such a survey, therefore, 
needs a trade-off between these two competing parameters. 
There are three quantities that are important when consid- 
ering survey geometry. These are the area of the survey, A s ; 
the number density of lensed galaxies, n 9 ; and the redshift 
distribution of the galaxies, which is well characterized by 
the median redshift of the distribution, z m . 

Figure [2] shows the impact of each of these parame- 
ters on the FOM. Looking at each component separately, 
we find, by varying each parameter of the survey while fix- 
ing all the other parameters, that the figure of merit scales 
linearly with area (FOM tx A]- ). The scaling is also al- 
most linear with galaxy density counts (FOM <x ng 10 ) 
and has a stronger dependence on the median redshift 
(FOM oc z"). These dependencies need to be cast in terms 
of observation time, which can only be fully evaluated using 
full image simulations fo r a survey. These hav e been per- 
formed for both DUNE |Refregier et all l2006h and SNAP 
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l|Massev et al.|[2004bl ). To estimate t he dependency, we us e 
the survey times quoted in table 1 of lRefregier et all l|2004h . 
which are the results of SNAP simulations. We find that 
z m oc to b ° s 67 and ng oc t°^f. Combining these effects, figure 
[3] shows the fractional change in FOM as a function of the 
fraction of time dedicated to increasing z m , ng and A s . We 
see that the dominant property of a survey is its area. Even 
when combining the benefits of a deep survey, i.e. higher z m 
and ng, we see that A s still dominates. 

Finally, we perform a full deep vs wide trade-off study. 
Figure [4] shows the FOM for a survey limited to a mission 
time of three years with 60% observing efficiency. Under 
these restrictions, the survey can either be wide, at the ex- 
pense of depth or vice versa. In order to correctly scale A s , 
n g and z m we continu e to use the space image simulation re- 
sults of iMassev et al.l <|2004bh . We have also confirmed that 
this scaling is consi stent with the scaling found in the simu- 
lations described in Rcfrcgicr et all (120061 ). We see that once 
again a maximum FOM is achieved by the widest possible 
configuration, which is 20,000 square degrees. 



4.2 Photometric Redshifts 

For weak lensing tomography, the redshift distribution of the 
galaxies in each redhsift bin must be known. Spectroscopi- 
cally measuring the redshifts of the galaxies over the entire 
area of future lensing surveys is not feasible. Hence, the next 
generation of lensing surveys will rely heavily on photomet- 
ric redshifts, based on multi-color photometry. The accuracy 
of this method depends on a number of factors including: I) 
the accuracy of the photometry; 2) the number of bands; 3) 
the covered wavelength range; and 4) the number of spec- 
troscopic redshift used for calibration. These factors, along 
with the details of the specific mock or photo-z method used, 
impact the photometric errors in complicated ways. Instead 
of tackling these details directly, we choose to model their 
impact in a more generic and simple way here. However, 
a more detailed analysis of specific imaging surveys, such 
as DES, Panstarrs and LSST, is in progress and will follow 
shortly (Abdalla et al in prep). 

Here we consider 3 distinct effects: statistical disper- 
sion, catastrophic failures and calibration errors for a given 
number of available spectroscopic redshifts. We model the 
statistical dispersion in the measured redshift with a Gaus- 
sian that has a standard deviation of 8 S . This is illustrated 
by the red curve in figure [5] The catastrophic failures are 
modeled by adding extra Gaussians to the PDF of the galax- 
ies, as shown by the the blue curve in figure [5] where the 
fraction of galaxies that suffer catastrophic failures is feat- 
Finally we use the number of galaxy spectra, n s , to assess 
how well the mean and the variance of the redshift slices can 
be measured and hence calibrated. The details of the model 
we use are described in detail in appendix lAl 

Figure [S] shows the impact of these effects on the red- 
shift slices. For our surveys, we assum e that galaxies ar e 
distributed according to a PDF given bv lSmail et all l|l994h . 
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where we set a = 2 and f3 = 1.5. The median redshift of the 
survey, z m , is then used to set zq — z m j\AYl. 



Figure 2. Effect of survey geometry on the DE FOM. For each 
of the plots shown here only one survey property is varied at any 
time with others held fixed. Top: Figure of merit as a function of 
survey area. The symbols show the results of a Fisher calculation 
and the line shows a linear fit to the data. Middle: Figure of merit 
as a function of galaxy number density. The redshift distribution 
is the same for all the points with a median redshift of 1.23. 
Bottom: Dependence of the figure of merit on the median redshift 
of the lensed galaxies. The number density of galaxies has been 
fixed to 100 galaxies per square arc minute. 
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Figure 3. Gains in FOM when time is dedicated to increasing 
one of the three parameters which impact the statistics of cosmic 
shear. We see that devoting observing time to increasing the area 
of the survey has the greatest impact on the FOM, while the 
change in median redshift lensed galaxies causes a minimal change 
in FOM. When performing a deep vs. wide trade-off study, these 
three factors fall into two groups. Increasing the area requires 
observing time being spent going wide, while the other two factors 
prefer a deep survey. Taking this into account, we see that the 
gains from increasing area out-weigh the combined gains of zm 
and ng. 

Figure [6] shows the change in figure of merit as a func- 
tion of S z . We see a clear degradation of the FOM with 
increasing S z . We find that for our ideal survey, the figure 
of merit scales as, FOM oc 10 -1 - 644 * . We also find that for 
a shallower survey, with z m = 0.9 and ng = 35, the figure 
of merit also shows a drop with S z , (FOM oc 10~ 169Sz ). In- 
vestigating the impact of catastrophic failures, we also find 
a decrease in the figure of merit for an increase in the catas- 
trophic failure fraction, / cat , (figure [7J . For the ideal sur- 
vey, we find that FOM oc lO" - 75 ^' . We also find that for 
a survey with the same geometry as our ideal survey but 
with S z = 0.1, FOM oc lO" ' 94 ^ , the shallow survey with 
8 Z = 0.01 has a FOM oc l0-°- 93/cat , and finally that a shal- 
low survey with 5 Z = 0.1 has FOM oc I0~ 11/cat . 

Next we show the requirements on the calibration sam- 
ple, namely how the FOM depends on the number of spec- 
troscopic redshifts available, n s . Figure [8] shows the results 
for four cases, (i) our ideal survey, (ii) our ideal survey with 
5 Z = 0.1, (iii) our shallow survey with S z = 0.01, and (iv) 
our shallow survey with S z — 0.1. These values of S z have 
been chosen to look at the difference one would expect from 
a good photo-z survey (5 Z — 0.01) and a more modest sur- 
vey (<5 Z = 0.1). For each case we show the calculations for 
two scenarios: in the first we marginalize over both the mean 
and the variance, and in the second we marginalize only over 
the mean and fix the variance. From our results we see that 
uncertainties in these quantities (mean and variance) are 
important and can substantially reduce the sensitivity of a 
survey, although it is interesting to note that even with a 
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0.01 t H 

100 1000 10000 
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Figure 4. The results of a deep vs. wide trade-off study, given 3 
years of observing time. The survey area, galaxy number counts 
and their median redshift are calculated by int erpolating and ex- 
trapolating the results of lMassev et al . (2004b). The three quan- 
tities are strongly correlated. Hence, a wide survey will have a 
lower galaxy number density and median redshift than a survey 
covering a small area. The upper panel shows the optimisation 
using the FOM, quantifying the error levels on a 2 parameter w 
model, and the lower panel shows the errors on the equation of 
state for a constant w model (i.e. a 1 parameter w model). As 
discussed in section [2. II improvement as measured by the FOM 
is greater than the improvement from a 1 parameter w model. 

small number of calibration galaxies, weak lensing tomogra- 
phy is able to provide good self-calibration. For large number 
of galaxies in the range 10 4 — 10 5 , the pho tometric calibra - 
tion is robust, which is in agreement with iMa et al] (|2006l ) 
who also find that they need this many galaxy spectra for 
calibration. We see that in this region where very few galaxy 
spectra are available, uncertainties in both the mean and the 
variance play an important role. However, if the number of 
calibration galaxies exceeds 10 4 , only the uncertainty in the 
mean is important. This suggests that if we do investigate 
higher order moments, they should only be important when 
the number of galaxy spectra is small. We can expect that 
above 10 4 galaxy spectra, the uncertainty in the mean will 
continue to dominate. 

4.3 Shear Measurement Systematics and 
Theoretical Uncertainties 

The final sources of error we consider are those associ- 
ated with the lensing power-spectrum itself. These errors 
could have a number of origins, ranging from residual galaxy 
shape correlations arising from imperfect PSF deconvolu- 
tion to uncertainties in the theoretical predictions. Due to 
the potential complexity and unknown nature of this er- 
ror, we consider a simple phenomenological error model for 
the power spectrum. Specifically, we consider a systematic 
uncertainty in all the power-spectra (auto-correlations and 
cross-correlations) of the form, 
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Figure 5. The upper panel shows the redshift distribution of 
galaxies, with z m = 0.9, divided into 10 redshift bins. The pho- 
tometric redshifts are assumed to have <5 Z = 0.01 and no catas- 
trophic failures. The lower panel shows the distribution of the 
galaxies in the 8th redshift bin. The red curve shows this red- 
shift slice for 8 Z = 0.1, and the blue curve also has S z = 0.1 and 
/cat = 0.3. 



Figure 7. Impact of catastrophic failures of photometric redshifts 
on the figure of merit for four survey configurations. The results 
in black are for surveys with 8 Z = 0.01, and the red results are for 
surveys with S z = 0.1. The crosses show the Fisher matrix results 
of surveys that have the same geometry as our ideal survey. The 
triangles show the results for a shallower survey, with z m = 0.9 
and ng = 35. The lines show a linear fit. 
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Figure 6. Impact of statistical errors in photometric redshift on 
the figure of merit parameter. In black, we see the impact on the 
ideal survey, and in red we see the results for a low redshift survey 
(z„, = 0.9). The symbols show the results of the Fisher matrix 
analysis, and the lines show a linear fit to the data to highlight 
the trends. 



Figure 8. Importance of spectroscopic redshift measurements 
for cosmic shear tomography. Four cases are shown: (i) our ideal 
survey -black solid curves; (ii) our ideal survey with 8 Z = 0.1 - 
black dashed curves; (iii) our shallow survey with 8 Z = 0.01 - 
red solid curves; and (iv) our shallow survey with 8 Z = 0.1 - red 
dashed curves. For each case, two curves are shown. The bottom 
curves correspond to marginalizing over both mean and variance 
of the distribution, while the top curves are the results when we 
marginalize only over the mean and fix the variance. 
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ACV lSys = ACi a 



la 



(6) 



where ACi a controls the amplitude of the systematic error 
at a reference multipole i . 

We add this error to the statistical errors by creating a 
covariance matrix with the same dimensions as that of the 
statistical covariance matrix. We then fill the diagonal ele- 
ments of this systematic covariance matrix with the square 
of the error shown in equation [6] and add this matrix to the 
statistical covariance matrix. This, therefore, assumes that 
these systematic errors are uncorrelated. 

Figure[9]shows the results of adding these systematics to 
both the ideal survey and the shallow survey. We find that, 
if we choose £ — 700, the impact of the systematic errors is 
largely independent of f3 over a reasonably large range ( — 2 < 
f3 < 1). This suggest that it is the level of the errors at t ~ 
700 that dominate. The exception to this is /3 = — 1, where 
the level of the error does not affect the figure of merit until 
AC io ~ 10" 11 for the ideal survey and AC lo ~ 10~ 10 for the 
shallow survey. This is because for /3 = — 1 the systematic 
errors scale roughly the same way as the statistical errors 
(including cosmic variance) on these scale. This means that 
the systematic errors are subdominant until they reach the 
same level as the statistical errors. Beyond this point, the 
systematics errors dominate on all scales and, hence, have a 
large impact on the FOM. We find that the scaling for the 
ideal and the shallow survey are well described by 



FOM oc AC e> 
and 



FOM <x AC, 



V fif +o.i I 



V 0.1 / 



(7) 



(8) 



respectively, where /3j = f3+l. This is only valid in the range 
2 x 10~ 13 < G lo < 2 x 10~ 10 , with (3 / -1. For (3 = -1, 
this is only valid in the ideal case for ACe a < 5 x 10~ n and 
for the shallow survey for ACe a < 10 -10 . 

Another source of concern is the uncertainty of the the- 
oretical predictions of the power spectrum on small scales. 
In particular, the role played by baryons is not well under- 
stood at present (|Zhan and Knoxl [20o3 ; Ijing et all [2006). 
One simple way to circumvent this is to only fit within a 
scale range where the theoretical model is more secure. As 
explained in section [3] all the calculations desribed above 
assumed the range 10 < £ < 2 x 10 4 . Fi gure [10] shows the 
FOM as a function of £ m ax, where the analysis has been 
performed in the range 10 < £ < £ ma x- We see the i ma x 
has a significant impact on the precision of the survey. We 
find that for the ideal case FOM oc CnL but for a shallow 
survey with 5 Z =0.1 we find FOM oc 
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5 DISCUSSION 

In the preceding sections we have calculated the dependence 
of the FOM on a number of limiting parameters pertaining 
to survey properties and systematic effects. For this pur- 
pose, we considered deviations from set benchmark surveys, 
namely an ideal survey, which is both wide and deep, and a 
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Figure 9. Impact of systematic errors in the lensing power- 
spectra on the FOM. The top panel shows the results for our ideal 
survey, while the lower panel shows those for the shallow survey 
(with ng=35, z m =0.9 and <5 Z =0.1). For each panel we show the 
FOM against AC^ Q , the value of the systematic error at I = 700 
for a number of slopes, fi. We see that for I = 700, the impact 
of the error are largely in agreement up to ~ 10 -10 . This is not 
true for j3 = — 1, which is the slope of the statistical error. For 
this case the FOM is not affect by this form of error until they 
reach the same level as the statistical errors. 
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Figure 10. Figure of merit as a function of the maximum multi- 
pole i max used for the tomographic measurement. The minimum 
multipole is kept constant at Z m i n = 10. The upper points (black 
crosses) are for the ideal survey, and the lower points (red trian- 
gles) are for a shallow survey with z m = 0.9, n g = 35 amin - 2 
and &z = 0.1. 
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shallow survey, which is wide. The results are summarised 
by Equations [9] and [TO] which give global analytic fits to the 
scaling of the FOM about the two respective surveys. These 
equations are convenient for the planning of future surveys, 
as it provides a first order optimization without needing to 
resort to the full Fisher matrix analysis. 

We have also performed a test to verify that the two 
expressions are consistent with each other. Figure [TT1 shows 
the comparison between our two fitting functions (equation 
[9land [T0| . We use the parameter 7 to interpolate z m , n g and 
5 Z between the central models in equation [9] and equation 
I lOt where n s has been set high enough that we reach the 
assymptotic values shown in figure [8] Noting that 7 = 
corresponds to the values in equation [10] and that 7 = 1 
corresponds to equation [9] we see that, as expected, each fit 
work well close to point where it is supposed to. Also shown 
is a weighted average (FOM avc = jFOMi + (l-^)FOM 2 ), 
which fits remarkably well over the entire range. 

A further useful analysis is to look at the potential 
improvement in the FOM by considering a realistic range 
of survey parameters. This is shown in Table [2] For A s , 
and z m , the low er bounds are set by the CFHTLS 



l|Hoekstra et al.l [20061 ) . while the upper bounds are set by 
planned future missions, specifically, DUNE for the area and 
SNAP wide for the depth and number counts. Next, we con- 
sider realistic ranges for the photometric redshift uncertain- 
ties. For this, we assume that current photometric redshifts 
have a statistical rms error of 10%/(1 + z) with 10% catas- 
trophic failures and can be calibrated using roughly 1000 
spectroscopic measurements. To illustrate the room for im- 
provement, we assume that future surveys will have a cali- 
bration set made up of 10 5 spectroscopic galaxies, a statis- 
tical error of 1%/(1 + z) and no catastrophic failures. This 
should be possible with a sufficient filter set that covers both 
the visible and the near infrared (NIR). 

For the ^-mode cut off, we consider the range 10 3 < 
imax < 10 5 , which is the range that is typically considered 
in theoretical analyses and corresponds to discarding non- 
linear modes (lower bound) to including subarcmin scales 
(upper bound). Finally, we consider the expected range of 
measurements and theoretical systematic errors. For the 
lower bounds in Table [2] we consider recent measurements 
of erg. These results show a scatter at the 10% level, which 
is due to a combination of both statistical and systematic 
errors. In these measurements, errors at typically quoted at 
the 5% level (1<t) and include statistical errors and system- 
atics, which would suggest that the remaining unaccounted 
for systematic errors are at the 8% level. Given that there is 
currently 5% uncertainty on the theoretical power-spectrum, 
we estimate that the total systematic errors on current as 
(or shear) measurements are at the 10% level. This, in turn, 
translates into a 20% uncertainty in the shear power spec- 
trum Ce . Future lensing surveys have se t 0.1% as their target 
uncertainties on shear measurements |Huterer et al. 2006; 
Ivan Waerbeke et all 120061 ; iRefregier et al.l |2006h . which 
translates into an uncertainty of 0.2% in Ce- These system- 
atic uncertainties can be converted into our ACe„ through 
the values of Ce at I — 700, which we find to be ~ 10 -9 on 
average for the 'ideal' configurations we have considered in 
this work. 

We use equation [9] to estimate the impact on the FOM 
from the ranges shown in table[2] We note, however, that the 



impact of the measurement and theory systematics part of 
the error budget is difficult to quantify accurately, since its 
extent will depend on the exact form of the uncertainty. To 
obtain the trends shown in equations l9l and [TOl we have only 
considered uncorrelated errors, assuming a simple power-law 
form. We, therefore, consider this to be the lower bound 
of the impact. More work is needed to further refine these 
estimates. 

From table [5] we see that of three properties of a lens- 
ing survey, A s , n g and z m , it is from the increased area 
that future surveys will most significantly improve over cur- 
rent measurements. The impact of the other two properties 
falling far behind. It is clear, therefore, that future surveys 
should first aim to increase area, then increase the num- 
ber of galaxies and finally increase the median redshift of 
the lensed galaxies. From the table, we also see that the 
greatest gains in systematics can be achieved by increasing 
(■max, which is a threshold set by our uncertainties due to 
highly nonlinear interactions and, more importantly, bary- 
onic physics. We see that the other important systematic is 
the measurement/theoretical uncertainty in Ce, which will 
require: (1) improved numerical simulation of the nonlinear 
regime; (2) improve ments in shear measurement techniques 
|Massev et al.ll2006h ; and (3) tight control of image quality 
and PSF. 

Finally, we see that improvements in the photometric 
redshift errors will cause a small, but non-negligible, im- 
provement in the FOM. This improvement will come from 
two main fronts: (1) an increase in the number of galaxy 
spectra available for calibration; and (2) from reducing the 
spread of the errors, which can be achieved through better 
photometry measurements over a large filter set. We see that 
relative to the other factors that have been considered, the 
effect of catastrophic failures is small. 



6 CONCLUSION 

In this paper, we have studied how future weak lensing sur- 
veys can be optimised to maximise the constraints on dark 
energy parameters using the figure of merit proposed by the 
Dark Energy Task Force. For this purpose, we have con- 
sidered the impact on power spectrum tomography of the 
survey geometry and of various systematics, such as photo- 
metric redshift errors, shape measurement errors, and the- 
oretical uncertainties. This was done by considering devia- 
tions from benchmark surveys to study the impact of each 
of these effects. 

We found that, for a given observing time, it is always 
advantageous to choose a wide rather than deep survey ge- 
ometry. We also find that a wide shallow survey captures 
most of the dark energy information by reaching a FOM of 
about 110, which is only a factor of 5 less than an unrealistic 
maximal survey that is both wide and very deep. 

We also found that the dark energy constraints from 
power spectrum tomography are fairly robust to systemat- 
ics . In particular, w e extend and simplify the formalism used 
bv lMa et al.l l|2006h for modeling the impact of photometric 
redshift errors. Using this, we find that as long as a sub- 
sample of 10 4 — 10 5 spectra are available for calibration, 
weak lensing tomography is robust to statistical errors and 
catastrophic failures in the photometric redshifts. The intro- 
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Range 


Fractional 
change in FOM 


A s 


170 


-> 2 x 10 4 


120 


n g 


15 


-> 100 


6.7 


Zm 


0.66 


-» 1.23 


2.1 


n s 


10 3 


-> 10 5 


1.5 


&z 


0.1 


-> 0.01 


1.4 


feat 


0.1 


-> 0.0 


1.2 


^max 


10 3 


-> 10 4 


6.6 


AC^/10- 9 


0.2 


-> 0.002 


^ 2.2 



Table 2. Table showing the fractional improvement in the FOM 
for a feasible range of survey properties using equation [9] 
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Figure 11. Comparison between the fitting functions shown in 
equations[2](bottom black curve) and llOl ftop black curve). We use 
7 to interpolate between the values of FOM\ (7 = 1) and FOM2 
(7 = 0). The points show the full Fisher matrix calculations. The 
blue dashed curve shows the weighted average (■yFOMi + (1 — 
"f)FOM2). This figure also assumes that n s is sufficiently high 
in all cases so that the assumptotic values shown in figure [8] are 
reached. 



duction of statistical dispersion, bias or catastrophic failures 
of photometric redshifts indeed only affect the dark energy 
figure of merit at th e 40% leve l, whic h is consistent with 
the recent findings of I Jain et al.l ((2007) . This is small com- 
pared to the limitations that come from uncertainties in the 
power spectrum, due to measurement-theoretical systemat- 



ics or requiring a high I cutoff, which, respectively, have a 
factor of 2.2 and 3.5 impact on FOM. 

In our optimisation study, we have only considered 
power spectrum tomography. Other statistics have been pro- 
posed which can improve upon the constraints of power spec- 
trum to^nogj^ir^Tmr^ferffar, cross-correlation cosmogra- 
phy l|Bernstein and Jainll2004l ) can separate the geometrical 
effects of dark energy from its effect on the growth of struc- 
ture. Higher order statistics such as the bispectrum or peak 
statistics are also known to break degeneracies present when 
2-point functions are used. While iTavlor et ail (|2006t ) have 
shown that cross- correlation cosmography, at least when 
measured using detected haloes, also favors wide rather than 
deep surveys, it would be interesting to study whether the 
optimisation results obtained here can be generalised to 
these more advanced statistics. We have also assumed that 
our cosmic variance errors are Gaussian, which does not hold 
on small scales (10 3 < I < 10 5 ). Including the effects of 
non-Gaussian errors is being investigated by other authors, 
and current results indicate that for the wide survey ge- 
ometry studied here the impact of non-Gaussian errors is 
small (Takada et al - private communications). This, how- 
ever, needs to be quantified further. 



APPENDIX A: 
MODEL 



PHOTOMETRIC REDSHIFT 



The first uncertainty we consider is a statistical errors in 
measuring the photo-z. We therefore begin by assuming that 
the redshift measured using photometry is related to the 
true redshift, z t , through a probability distribution func- 
tion, P B tB,t{zt), that is Gaussian and centered on Zf This 
models the spread caused by photometric errors in the mea- 
surements. We are able to vary the level of the error through 
the standard deviation of P s tat, which we assume scales as 
(l+z). This leads to: 



(Pstat) = d,Z 2p st at(z) = Z t , 



CT 2 (Pstat) = J dz Z 2 Pstat (z) = 6*(1 + 



Zt 



(Al) 



(A2) 



where the level of this error is 8 Z , which is dominated by the 
quality of the photometric measurements and the number of 
filter bands used, and Zt is the true redshift of the galaxy. 
This spread is illustrated by the red curve in figure [5] 

Another source of error is introduced if the wavelength 
range covered is insufficient to correctly identify features 
in the galaxy spectrum, such as the Balmer break. This 
misidentification leads to galaxies being assigned a redshift 
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that is off-set by A z from the the true redshift. We thus con- 
struct a probability distribution function to describe these 
catastrophic failures, P ca t(zt). Since galaxies can either be 
misidentified with a redshift offset A z , that is either higher 
or lower than the true redshift, we can construct a generic 
form of Pcat(zt) to be bimodal, P ca t(zt) = P^ t (z t ) + P~ at {z t ), 
with P^ t (z t ) capturing the PDF information for the catas- 
trophic failures at redshifts greater than zt and P c ~ t (zt) cap- 
turing the information about the scatter to redshifts below 
Zt- Then, 



(-Peat) — 2 cat — Z t ± A z 



(A3) 



The value A z depends on the set of filters used and 
the spectral properties of the galaxies in the survey. For 
simplicity we assume that A z = 1 and that the uncertainty 
of catastrophic failure population is the same as that for a 
galaxy whose true redshift is z ca t, 



<T(P c tt)=<Ul+Zcat) 



(A4) 



Combining these two effects we can create the probability 
distribution function of photometric redshift given a galaxy 
with a true redshift zt, 



P{Zphot\zt) = (1 — /cat)Pstat(^t) + /cat Peat (^i), 



(A5) 



where / ca t describes the fraction of galaxies that are misiden- 
tified. An example of this is the blue curve of figure [S] Us- 
ing Bayes' theorem, this conditional probability can be con- 
verted into a full 2D PDF, 



P{zt,z phot ) = P(z p h t\Zt)P(Zt), 



(A6) 



where P(zt) is the probability distribution of true redshifts 
of the galaxies in the survey. This can be used to calculate 
the underlying distribution of galaxies after a cut is applied 
to the measured photometric redshifts. Cuts can be made 
by applying a window function, Wi(z p \ lot ) , and obtain 



Pi(Zt) 



d2photH / i(2phot)P(«phot, Z t ). 



(A7) 



This is very similar to the approach taken by iMa et all 
l|2006h . 

Thus far we have examined the impact of changing 
the shape of the PDFs of the galaxies in each redshift bin. 
This, however, does not take into account the errors intro- 
duced from the uncertainties in the shapes of the PDFs. 
This third uncertainty arises because the algorithms used 
to measure the photometric redshifts of galaxies are devel- 
oped and tested using a finite subsample of galaxies whose 
redshifts are measured spectroscopically. The first step in 
performing this calibration is to ensure that the sample of 
spectroscopically measured galaxies are a fair representation 
of the galaxies in the lensing survey, i.e. subject to the same 
selection criteria. With this done, the accuracy with which 
we are able to measure the PDF of the galaxies per red- 
shift bin depends on the number of galaxies in that redshift 
bin. The number of spectroscop ic galaxy r edshi fts that are 
needed has also been studied bv lMa et all l|2006h by assum- 
ing a 64 parameter binned bias model. 

Here, we take a slightly different approach. We first as- 
sume that the photo-z's are used to divide the distribution 
of galaxies into redshift bins (as shown in figure [5j . Since 
the bias can take any form and need not be monotonic, we 



do not try to constrain this directly in the photo-z code. In- 
stead we use the fact that the PDF in each bin will be sam- 
pled with a finite number of galaxies, and it is that which 
will determine the accuracy of the distribution in each bin. 
In principle, this approach can be used to place error bars 
on all the moments of the distribution, which can then be 
marginalized over, but for now we focus on the mean and 
the variance of the distribution. 

For each bin we introduce the mean and the variance 
of each as a free parameter, which increases the number of 
parameters in our Fisher matrix from 7 to 27. We then add 
a prior to the Fisher matrix that depends on the number 
of spectroscopic galaxies in each redshift bin. The priors on 
the mean are calculated using 



a (z) = a (z)/n s 



(A8) 



where o 2 (z) is the variance of the mean, n s is the number of 
spectrascopic galaxies in each bin. Similarly, the priors on 
the sample variance are calculated by 



a 2 (a 2 (z)) = 



H4 



(A9) 



where /i4 is the fourth order moment of the galaxy distribu- 
tion in each bin. 
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